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CENPA/Cse4 assembles centromeric chromatin on diverse DNA. CENPA chromatin is epigenetically propagated on 
unique and different centromere DNA sequences in a pathogenic yeast Candida albicans. Formation of neocentromeres on 
DNA, nonhomologous to native centromeres, indicates a role of non-DNA sequence determinants in CENPA deposition. 
Neocentromeres have been shown to form at multiple loci in C albicans yN\\en a native centromere was deleted. However, the 
process of site selection for CENPA deposition on native or neocentromeres in the absence of defined DNA sequences 
remains elusive. By systematic deletion of CENPA chromatin-containing regions of variable length of different chromo- 
somes, followed by mapping of neocentromere loci in C albicans and its related species Candida dubliniensis, which share 
similar centromere properties, we demonstrate that the chromosomal location is an evolutionarily conserved primary 
determinant of CENPA deposition. Neocentromeres on the altered chromosome are always formed close to the site which 
was once occupied by the native centromere. Interestingly, repositioning of CENPA chromatin from the neocentromere to 
the native centromere occurs by gene conversion in C albicans. 

[Supplemental material is available for this article.] 



A functional centromere, the chromosomal site where a kineto- 
chore assembles and attaches to spindle microtubules to facilitate 
proper chromosome segregation, is formed only once on each 
monocentric chromosome in normal cells. Occasionally, dicentric 
chromosomes are formed in a diseased state such as in cancerous 
cells that show high genome instability (Shen 2011). The mecha- 
nism that assures the presence of only one functional centromere 
in a chromosome is largely unknown. While centromeres perform 
an indispensable function in chromosome segregation, underlying 
centromeric sequences are highly divergent (Henikoff et al. 2001). 
The presence of CENPA, a variant of histone H3, at centromeric 
chromatin is a conserved feature of all eukaryotic chromosomes 
(Stimpson and Sullivan 2010; Black and Cleveland 2011). How- 
ever, the mechanism of CENPA recruitment in yeast to humans 
may not be universal (Sanyal 2012). 

A landmark discovery of neocentromere formation on a locus 
nonhomologous to the native centromere in an altered "acentric" 
chromosome found in a boy with a learning disability provided 
strong evidence that non-DNA sequence determinants can play 
a role in centromere specification (Voullaire et al. 1993). Since the 
discovery of the first human neocentromere, several neocentro- 
meres have been identified in humans, and the occurrence of 
neocentromeres has been reported in fungal, animal, and plant 
species as well (Marshall et al. 2008; Marshall and Choo 2009). 
Human neocentromeres also contain most other proteins required 
for proper kinetochore formation (Alonso et al. 2007). Thus neo- 
centromere formation provides an excellent model for the study of 
the determinants of CENPA deposition that forms centromeric 
chromatin. 
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Neocentromere formation on a non-native locus takes place 
only in the absence of the functional native centromere in a nor- 
mal cell (Ishii et al. 2008; Ketel et al. 2009). The mechanism that 
suppresses neocentromere loci in the presence of the functional 
native centromere remains largely unknown. In contrast to short 
point centromeres of budding yeast Saccharomyces cerevisiae that 
are genetically regulated, repetitive regional centromeres of most 
other eukaryotes including fission yeast Schizosaccharomyces 
pombe, Drosophila, plants, and humans are epigenetically regulated 
(Black and Cleveland 2011). Many epigenetic determinants of 
centromere function such as RNA interference, H3K9Me2-marked 
pericentric heterochromatin, and DNA methylation have been 
identified in several organisms (Lachner et al. 2001; Nakayama 
et al. 2001; Wong et al. 2006; Folco et al. 2008; Gopalakrishnan 
et al. 2009; Roy and Sanyal 2011). Two closely related human path- 
ogenic budding yeasts, Candida albicans and Candida dubliniensis, 
carry 3- to 5-kb-long unique centromeres that lack specific sequence 
motifs, characteristics of point centromeres, as well as features as- 
sociated with regional centromeres such as repeat elements, DNA 
methylation, H3K9Me2-mediated heterochromatin, or CENP-B 
proteins (Sanyal and Carbon 2002; Sanyal et al. 2004; Baum et al. 
2006; Padmanabhan et al. 2008). These short regional centromeres 
of C. albicans have been shown to be regulated epigenetically 
(Baum et al. 2006). However, the nature of epigenetic factors re- 
mains to be determined. Neocentromere formation at multiple 
ectopic loci has been shown to be highly efficient in C. albicans 
(Ketel et al. 2009). Thus, the C. albicans neocentromere provides an 
attractive system for the study of enigmatic epigenetic regulatory 
factors involved in formation of CENPA chromatin. 

More interestingly, centromeres of C. albicans have under- 
gone rapid divergence from C. dubliniensis (they diverged from 
each other —20 million years ago) (Padmanabhan et al. 2008). Such 
rapid evolution of centromeres reveals a remarkable plasticity for 
DNA sequences in centromere function. A similar divergence in 
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centromeric sequences has been reported in closely related fungal 
{Schizosaccahromyces sp.), plant (Oryza sativa and O. brachyantha), 
and animal (Muscaroli and other mouse species) species, and even 
in closely related lineages of the Saccharomyces paradoxus (Kipling 
et al. 1995; Lee et al. 2005; Bensasson et al. 2008; Rhind et al. 201 1). 
Asymmetric female meiosis has been proposed to be the drive for 
selfish centromeres to gain a transmission advantage that leads to 
rapid centromeric evolution (Henikoff et al. 2001). However, the 
process responsible for the rapid evolution of CENPA-binding 
regions in lineages of the S. paradoxus with symmetric meio- 
sis (Bensasson et al. 2008) and closely related C. albicans and 
C. dubliniensis which lack any detectable meiotic cycle remains 
unknown. 

In the present study, a comprehensive analysis of neo- 
centromere formation identified physical chromosomal location 
as a novel epigenetic factor that regulates CENPA deposition at 
Candida centromeres. A comparative analysis revealed that cen- 
tromere location and the dynamics of neocentromere formation 
are evolutionarily conserved in C. albicans and C. dubliniensis. We 
demonstrate that the physical chromosomal location not only 
ensures CENPA deposition at only one locus, but also prevents its 
deposition at other potential centromere-forming loci in Candida. 
Most strikingly, we observed that gene conversion occurs at 
C. albicans centromeres. We speculate that gene conversion may 
contribute to both rapid evolution of centromere DNA and con- 
servation of centromeric location in the same and other closely 
related species. 

Results 

Relative chromosomal position of CENPA-binding regions 
in C albicans and C dubliniensis remains evolutionarily 
conserved in spite of extensive cliromosomal rearrangements 

Two closely related species of Candida, C. albicans and C. dubliniensis, 
share a high degree of DNA sequence homology as they diverged 
—20 million years ago (Mishra et al. 2007). Centromere DNA se- 
quences of C. albicans and C. dubliniensis share no sequence ho- 
mology due to rapid evolution of centromere DNA sequences 
(Padmanabhan et al. 2008), although synteny of genes across 
CENPA-binding regions in most of the chromosomes is largely 
maintained in these two species. An Sfil macrorestriction map is 
now available for both C. albicans (1006 and white opaque switch- 
ing strain WO-1; Chu et al. 1993) and C. dubliniensis (Strain Cd36; 
Magee et al. 2008). C. albicans WO-1 (http://www.broadinstitute. 
org/ annotation/ genome/ candida_albicans/ GenomesIndex.html? 
component=%24DirectLink&service=direct&session=T&sp=SCAWl) 
and C. dubliniensis (http://www.sanger.ac.uk/resources/downloads/ 
fungi/candida-dubliniensis.html) have undergone various gross 
chromosomal rearrangements during the course of evolution, re- 
sulting in the rearrangement of Sfil fragments between different 
chromosomes (Magee and Magee 1997; Jackson et al. 2009). We 
aligned and compared the relative arrangement of Sfil fragments 
on orthologous chromosomes of C. albicans 1006, C. albicans WO-1, 
and C. dubliniensis (Supplemental Fig. SI). Despite the nature of the 
rearrangements that occurred being very different, orthologous 
centromeres of all the chromosomes involved in these rearrange- 
ment events are located at a similar distance from one end of the 
chromosome in two clinical isolates of C. albicans 1006 and WO-1 
(Mishra et al. 2007) and C. dubliniensis (Padmanabhan et al. 2008). 
Thus, in spite of having dissimilar centromere DNA sequences, 
a high degree of conservation in centromere location suggests 



involvement of chromosomal locations in determining centro- 
meres' identity in Candida. 

Formation of centromeric chromatin on a DNA sequence 
is context-dependent 

To test if a specific chromosomal location per se, irrespective of 
the DNA sequence present, could be an epigenetic determinant for 
centromere formation in Candida, a 1.4-kb URA3 gene sequence 
was integrated into CEN7 (coordinates: Assembly 21 CaChr7, 
427233) in the C. albicans RMIOOOAH strain (Fig. lA; Sanyal et al. 
2004), and the expected integration was verified by Southern 
analysis (Supplemental Fig. S2A). Each homolog of chromosome 7 
(Chr7) is marked with a unique marker ARG4 or HISl at the same 
location, unlinked to CEN7 and present —450 kb away from CEN7 
on the right arm of Chr7 in RMIOOOAH. The frequency of chro- 
mosome loss was monitored in transformants carrying URA3 in- 
tegrated at CEN7 by measuring the simultaneous loss of two 
markers: ARG4 and URA3 or HISl and URA3. We did not detect any 
chromosome loss (URA loss <1 in 10^ cells) (Table 1) in such inte- 
grants, suggesting that the [/iM3-containing altered chromosome 
is mitotically stable. Subsequently, karyotypic analysis eliminated 
the possibility of chromosomal rearrangements in those RMIOOOAH 
{CEN7/CEN7::URA3) transformants (Fig. IB). Genomic DNA plugs 
prepared from five URA3 integrants were resolved on CHEF gels 
using conditions that separate all the chromosomes of C. albicans 
(see Methods), transferred to a membrane and probed with URA3 
or CEN7. None of these integrants showed any observable chro- 
mosomal rearrangements (Fig. IB). Interestingly, all the integrants 
{CEN7/CEN7::URA3) exhibited reversible silencing of URA3 as 
evidenced by their growth both on 5-fluoro-orotidic acid (FOA) 
and complete medium lacking uridine (CM-Ura) plates (Supple- 
mental Fig. S2B, left). Transcript levels were measured in one such 
integrant (J151) grown both on FOA and CM-Ura. Reverse tran- 
scriptase qPCR analysis revealed approximately fivefold repression 
in URA3 transcript levels on FOA as compared with CM-Ura (Sup- 
plemental Fig. S2B, right). It is important to note that C. albicans 
is a diploid organism, and thus one Chr7 homolog is unaltered 
while the other one has URA3 integrated at CEN7. Both ChlP-PCR 
and ChlP-qPCR analysis on CENPA/Cse4 ChIP DNA revealed that 
CENPA was recruited on URA3 of the altered homolog and on 
CEN7 of the unaltered homolog (Fig. 1C,D). These results strongly 
suggest that centromeric chromatin can assemble on noncentro- 
meric DNA when placed at the native centromere, without dis- 
rupting functional centromere formation at the native locus in 
C. albicans. 

Replacement of the core CENPA-rich native centromere 
by URA3 is sufficient to induce centromeric chromatin 
relocation but depends on the transcriptional status of URA3 

To examine whether CENPA-containing centromeric chromatin 
can also assemble on URA3 in the total absence of the core CENPA- 
rich region, we replaced the core 4.5-kb CENPA-rich CEN7 region 
(coordinates: Assembly 21 CaChr7, 424438-428994) by the 1.4-kb 
URA3 sequence in RMIOOOAH {CEN7/CEN7) (Supplemental Fig. 
S3 A). We performed 15 independent transformation experiments. 
To ensure the independent nature of each transformant, only one 
correct transformant screened by Southern blot analysis (the 
strategy is described in the Supplemental Methods) was selected 
from each transformation experiment for subsequent analysis. The 
transformants with desired CEN7 deletion were further analyzed 
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Figure 1. CENPA chromatin assembles at a specific chromosomal location. (A) Schematic showing the location of URA3 insertion at native CEN7 in 
one homolog of Chr7. Arrows indicate the gene location and the direction of transcription while the numbers represent coordinates on a specific 
chromosome based on Assembly 21 of C albicans SC531 4 genome. (B) An ethidium bromide (EtBr) stained CHEF gel image (left) showing no apparent 
alteration in the karyotype of RM1 000AH-CEN7 ::URA3 integrants (lanes 7-5) as compared with the parent RMl OOOAH (lane P). DNA was transferred 
on a membrane and Southern analysis was performed by probing the blot with either CaCfN/ {middle) or URA3 (right). (C) ChIP analysis with anti- 
Cse4 (CENPA) antibodies followed by PCR revealed recruitment of CENPA on URA3 of the altered homolog (left) as well as on native CEN7 of the 
unaltered homolog (right) of Chr7. Primer locations are marked with bars. LEU2 present on Chr7 but unlinked to CEN7 is used as a negative control for 
CENPA binding. (T) Total DNA; (+) with Ab; (-) without Ab (beads only). (D) Real-time qPCR analysis on CENPA ChIP DNA from RMIOOOAH- 
CEN7 ::URA3/CEN7 strain J154 using primers from CfN7 and URA3 regions. Fold enrichment of CENPA binding was calculated as described in 
Methods. 



by CHEF gel followed by Southern analysis as described above. 
One out of 15 transformants showed chromosomal rearrange- 
ments of C£N7-deleted Chr7 (Supplemental Fig. S3B) and was 
excluded from further studies. One of the remaining 14 trans- 
formants exhibited a high rate of loss of Chr7 (URA loss 20%) 
and was also excluded from further analysis. In all other trans- 
formants, CEN7-deleted altered Chr7 was mitotically stable (URA 
loss <1 in 10^ cells) (Table 1). To find out whether or not cen- 
tromeric chromatin assembled on URA3 that replaced native 
CEN DNA, CENPA/Cse4 ChIP assays were performed in 11 stable 
RMlOOOAH-cen/A {CEN7/cen7A) transformants. In 10 out of 11 
RMlOOOAH-cen/A transformants, PCR analysis using primers 
from different regions of URA3 revealed CENPA binding on one 
end of URA3. CENPA binding was found to be extended beyond 
URA3 until 2-3 kb toward the right arm of Chr7 spanning 
Orfl9.6520 and Orfl9.6522 {nCEN7-I) (Fig. 2; Supplemental Fig. 
S3C). Thus, a deletion spanning only a CENPA-rich region acti- 
vates neocentromeres at a location situated next to the deleted 
region. When ChIP assays were performed on cells grown in FOA 
media, CENPA binding showed a shift away from Orfl9.6520 
and Orfl9.6522 and encompassed the entire URA3 region, sug- 
gesting that transcriptional silencing favors CENPA binding in 
C. albicans. 



The physical chromosomal location of CENPA chromatin 
is an important determinant of centromere identity in Candida 

Detailed PCR analysis on CENPA/Cse4 ChIP DNA from 11 
RMlOOOAH-cen/A transformants, each of which carries a neocen- 
tromere-containing altered Chr7, using a set of primers covering 
a 20-kb region spanning CEN7, revealed that in addition to the 
native centromere of unaltered Chr7, a second neocentromere 
site {nCEN7-II, 1/11 transformant) was activated 3 kb away from 
URA3 on the left arm of the altered Chr7 spanning Orfl9.6526 
and Orfl9.6525 (Fig. 2). To confirm the binding of another evolu- 
tionarily conserved kinetochore protein CENPC1/Mif2 on nCEN7-I 
and nCEN7-II, we functionally expressed Myc-tagged CENPC1/Mif2 
(Methods) in RMl OOOAH -cen7A strains that carried a neocentric 
Chr7 (Supplemental Fig. S3D). PCR analysis on Myc (CENPC1/Mif2) 
Chip DNA revealed CENPCl enrichment on nC£N7-/ and nCEN7-II 
consistent with CENPA/Cse4 binding (Fig. 2). The length of the 
CENPA-rich region on a neocentromere in each case was 3-4 kb 
including -1-2 kb of variation among transformants within 
a specific class of neocentromeres. If neocentromere formation 
occurs randomly anywhere on Chr7 (—950 kb), the chance of 
its formation adjacent to a native centromere would be ~1 in 300. 
Since all the transformants analyzed in this study formed 
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Table 1. Chromosome loss frequency in various CfN-deleted 
transformants 
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neocentromeres adjacent to the native centromeres, we conclude 
that centromere proximal regions are the most preferred and thus 
biased for assembly of CENPA chromatin in the absence of the 
native core CEN7 sequence. 

Neocentromere formation at centromere proximal regions 
is independent of the size of the pericentric region deleted 

Pericentric regions that consist of inverted repeats and boundary 
elements facilitate CENPA deposition at 10- to 15-kb central do- 
mains (Takahashi et al. 2000; Allshire and Karpen 2008; Roy and 
Sanyal 2011) of the 5. pombe centromere. In the absence of any 
boundary elements (such as tRNA genes in S. pombe; Kuhn et al. 
1991; Takahashi et al. 1991), H3K9 methylation (found in peri- 
centric chromatin of most regional centromere-containing organ- 
isms; Bernard et al. 2001; Nakayama et al. 2001; Rice et al. 2003) or 
common repeat elements (Fishel et al. 1988), the length of the 
pericentric region is difficult to ascertain in C. albicans (Sanyal 
2012). Detailed bioinformatic analysis to find out pericentric re- 
peats in our previous study (Padmanabhan et al. 2008) revealed 



that pericentric regions surrounding all the centromeres except 
CEN5 lack proper repeats and contain only cryptic (or remnants of) 
transposable elements or inverted repeats. Interestingly, degener- 
ate retrotransposon-like repeat elements (e.g.. Teen, Tgll, andTgl2) 
are features of Neurospora erassa centromeres as well (Cambareri 
et al. 1998; Smith et al. 2011). Orthologous pericentric regions 
contain several short stretches of DNA sequences that are com- 
mon in pericentric regions of some, but not all, C. albicans and 
C. dubliniensis chromosomes (Padmanabhan et al. 2008). The for- 
mation of neocentromeres almost invariably at centromere prox- 
imal locations prompted us to hypothesize that, in the absence of 
specific sequence motifs or repeats, the more conserved pericentric 
regions may facilitate recruitment of CENPA at these regions. We 
anticipated that residual CENPA present on CEN7 proximal re- 
gions may be responsible for seeding assembly of CENPA chro- 
matin in these regions for neocentromere formation in the absence 
of the native centromere DNA. 

To examine whether or not neocentromeres are formed 
randomly at multiple locations in the absence of pericentric re- 
gions, we replaced the entire 6.5-kb ORF-free region including 
CEN7 (coordinates: Assembly 21 CaChr7, 423450-429852) with 
the 1.4-kb URA3 sequence (Supplemental Fig. S4A, left). Five cor- 
rect RM1000AH-cen7-6.5/cM transformants (Supplemental Fig. 
S4B, left) analyzed (each obtained from an independent trans- 
formation experiment) showed neither any rearrangement of 
Chr7 (Supplemental Fig. S4C, left) nor a significant chromosome 
loss (Table 1), suggesting neocentromere formation in each case. 
CENPA/Cse4 ChIP assays were performed in four RMIOOOAH- 
een7-6.SkbA transformants to map neocentromere sites. The most 
prevalent site of neocentromere formation was mapped to the 
same hotspot identified when the CEN7 core was deleted {nCEN7-T). 
A new site {nCEN7-II1) was mapped 13 kb away from the CEN7 
sequence toward the right arm of Chr7 on an ORF-free region 
(Assembly 21, CaChr7 Coordinates 442000-445000) (Fig. 2A; 
Supplemental Fig. S4D). The nCEN/-/// neocentromere location 
was further confirmed by co-localization of CENPA and CENPCl 
binding on the neocentric chromosome as described above 
(Fig. 2A). 

Attempts to construct a functional minichromosome in 
C. albicans have been unsuccessful so far; hence the precise bound- 
ary of a functional centromere or the pericentric region is undefined 
(Baum et al. 2006). It is possible that the pericentromeric module 
in C. albicans is longer than the ORF-free 6.5-kb region surround- 
ing CEN7. Thus subsequently we deleted a 10-kb (data not shown) 
or a 30-kb region including CEN7 (coordinates: Assembly 21 
CaChr7, 411320-440780) in RMIOOOAH (Supplemental Fig. S4, 
right panels). Due to the deletion of the 30-kb region, it was pos- 
sible to separate the unaltered and altered Chr7 in RMIOOOAH- 
cen7-30kbA on a CHEF gel using a specific run condition (data not 
shown). We selected those independent RM1000AH-cen7-30kbA 
clones that were stable {URA loss <1 in 10^ cells) (Table 1) and the 
altered chromosome moved faster on the CHEF gel (Supplemen- 
tal Fig. S4C). The CENPA/Cse4 ChIP followed by PCR analysis 
using primers from Chr7 revealed that neocentromeres formed at 
a site adjacent to the deleted region on Orfl9.6531, Orfl9.6532, 
Orfl9.6533, Orfl9.6534 {nCEN7-IV, 1-2 kb away from URA3 to- 
ward the left arm of Chr7) on the altered chromosome, in addi- 
tion to the native CEN7 of the unaltered Chr7 homolog in both 
the transformants (Fig. 2A; Supplemental Fig. S4D). To confirm 
the presence of a functional kinetochore at nCEN7-VI, we func- 
tionally expressed Myc-tagged CENPC1/Mif2 (see Methods) in 
RM1000AH-cen7-30kbA (J 191) strains that carried a neocentric 



Genome Research 641 

www.genome.org 



Thakur and Sanyal 



RMIOOOAH 



J 161 CENPA 
J176 CENPCl 



J 164 CENPA 
J178 CENPCl 



J181 CENPA 
J186 CENPCl 



J191 CENPA 
J193 CENPCl 



6515 
6517 
6518 

6520 
6522 
428994 I 

424438 I 

6524 
6525 
6526 

6527 

6528 
6529 



J CENPCl 

1 CENPA 



T -I- - 



1 kb 



i4.5kb 



i4.5kb ^ 



T + 



^3 



13 kb 



6528 
6529 



CD 

64195 A 

708o|i-£W2 m- 

Non-CENCD ^ 

CaChr7 assembly 21 




T + - 



3kb 



6.5 kb 



6527 



6528 { 

6529 4 



B 



J161 



CaChr7 assembly 21 



J 164 



7080 f LEU2 - 
CaChr7 assembly 21 



6528 { 

6529 i 



7080 f 



30 kb 



2kb l» — 



I LEU2 ^ 



CaChr7 assembly 21 



J181 



CaChr7 assembly 21 



J191 




CEN7 EESai LEU2 



CEN7 LEU2 



CEN7 



LEU2 



CEN7 ESSHEI LEU2 



Figure 2. Neocentromeres, like centromeres, preferentially form on specific chromosomal locations in a nonrandom fashion independent of the length of 
the deleted region in C albicans. (A) CENPA and CENPCl ChIP analyses in RMl OOOAH-cen/Zi, RMl 000AH-cen7-6.5 kbA, or RMl OOOAH-cen 7-3 0/cM clones 
mapped neocentromeres (nCENs) within 1 5 kb from the deleted region. C. albicans is a diploid yeast and carries two homologs of each chromosome. Only one 
homolog of Chr7 where CfN7 has been replaced by URA3 is shown. CENPA enrichment shown at the native CfN7 location is contributed by an unaltered 
homolog and is shown as a positive control for CENPA ChIP assays. Thick black arrows along with the Orf numbers show the gene arrangement and the direction 
of transcription. One representative clone from each class (see Supplemental Figs. S3C, S4B for other clones) is shown. Class I neocentromeres (nCfN7-/) formed 
on Orfl 9.6520 and Orfl 9.6522 (10/11 RMl OOOAH-cen 7zi and 4/5 RM1000AH-cen-6.5/cM transform ants), Class II neocentromeres {nCEN7-ll) mapped to 
Orfl 9.6526 and Orfl 9.6525 (1 /1 1 RMl 000AH-cen7zi), Class III neocentromeres {nCEN7-lll) mapped on an Orf-free region— Assembly 21 , CaChr7 Coordinates 
442000-445000 (1 /5 RMl 000AH-cen-6.5/cM transformant)— and nCEN7-IV mapped on Orfl 9.6531 , Orfl 9.6532, Orfl 9.6533, Orfl 9.6534 (2/2 RMl OOOAH- 
cen7-30kbA transformants). Parent RMl OOOAH did not show CENPA or CENPCl enrichment on these neocentromere sites. Coordinates of the control non-CEN 
(LEU2) region on Chr7 in Assembly 21 are 641 95-64440 (Orf 1 9.7080). (T) Total DNA; (+) with Ab; (-) without Ab (beads only). CENPA ChIP profiles are shown 
in red boxes and the same for CENPCl are shown in green boxes. Enrichment of CENPA at the URA3 location was not shown here but presented as a separate 
figure (Supplemental Fig. S5A). (B) CENPA ChIP qPCR analysis showing fold enrichment of CENPA on native and neocentromeres. 



Chr7. CENPCl ChIP assays in resulting strain J193 confirmed the 
co-localization with CENPA at nCEN7-VI (Fig. 2A). Together we 
conclude that neocentromeres always mapped to the centromere 
proximal regions when the core CENPA-rich region is deleted, 
irrespective of the length of the deleted region. Thus, the physical 
location, rather than the pericentric chromatin state, is a deter- 
minant of centromere/neocentromere identity in C. albicans. 

Next we compared the strength of CENPA binding on native 
centromeres and neocentromeres by real-time qPCR analysis on 
CENPA Chip DNA from neocentromere-containing strains. Real- 
time qPCR analysis revealed that native CEN7 is strongly enriched 



as compared with neocentromeres (Fig. 2B). Neocentromeres 
formed on the most prevalent site, nCEN7-I, which is closest to the 
native CEN7 location, showed —85% CENPA binding as compared 
with native CEN7, while neocentromeres nCEN7-II and nCEN7-IV 
that are formed 3-4 kb away from the deleted region showed 
—60% and —70% CENPA binding, respectively, as compared with 
that of the native centromeres {CEN7). As the distance of a neo- 
centromere site from the deleted CEN7 region increased, CENPA 
binding progressively dropped and CENPA binding to nCEN7-III, 
which is located 13 kb away from the native centromere, showed 
—35% as compared with native CEN7 (Fig. 2B). 
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CENPA Chip sequencing analysis confirms that CEN proximal 
neocentromeres are the unique enriched regions on the entire 
altered chromosome 

Since CENPA binding on nCEN7-II, nCEN7-III, and nCEN 7 -IV was 
significantly less (ranging from 35% to 70%) than the native CEN 7, 
we wondered whether or not these sites are the only CENPA-rich 
sites present across the entire altered Chr7. In order to verify this 
we performed ChIP sequencing experiments (see Methods) on the 
CENPA/Cse4 or CENPCl/MycMif2 ChIP DNA fractions from 
transformants carrying active neocentric {nCEN7-II, nCEN7-III, 
and nCEN7-IV) chromosomes. All the sequence reads aligned to 
native CEN7, nCEN7-II, nCEN7-III, and nCEN7-IV sites over a region 
of 3-5 kb length (GEO accession number GSE42907). Exclusive 
enrichment of CENPA at native CEN7 (of unaltered Chr7 homo- 
log) and the neocentromeres that we mapped (Fig. 3) excluded 
the possibility of the presence of ectopic CENPA-binding sites. 
Thus centromere-proximal CENPA-rich sites identified in this 
study are authentic neocentromeres. 

To examine if the dynamics of neocentromere formation is 
chromosome-specific, we extended our study to Chrl. The 4.2-kb 
CENPA/Cse4-binding region on Chrl (coordinates: Assembly 21 
CaChrl, 1562878-1567085) was replaced with the 1.4-kb URA3 
sequence in RMIOOOAH. CENPA/Cse4 ChIP assays were performed 
in two transformants harboring stable [/iM3-containing Chrl to 
map neocentromere locations. Again, both the neocentromeres 
formed at CENl proximal regions (coordinates: Assembly 21, CaChrl 
1550000-1555000) (Fig. 4A), suggesting that similar to Chr7, CEN 
proximal regions are the most preferred sites of neocentromere 
formation on Chrl as well as in C. albicans. 

In contrast to our study where neocentromeres always formed 
on centromere proximal regions, a previous study had shown that 
neocentromere formation in C. albicans takes place at multiple loci 
on Chr5 even at a distance as far as 450 kb from the deleted CENS 
(Ketel et al. 2009). CENS is the only C. albicans centromere that 
contains a core CENPA-rich region surrounded by long inverted 
repeats. To reexamine whether the pattern of neocentromere for- 
mation is different or not on Chr5, we replaced the entire CENS 
spanning the core and the repeats (coordinates: Assembly 21, 
CaChr5 466279-473499) by the same 1.4-kb URA3 gene. We 
expressed Prot A- tagged CENPA in wild-type RMIOOOAH as well 
as in six randomly selected RMlOOOAH-cenSA transformants that 
exhibited no loss of the altered chromosome. ProtA (CENPA) ChlP- 
PCR analysis showed that CENPA was enriched at the native cen- 
tromere of the unaltered Chr7 as well as on C£N-proximal regions 
(nCENS-I: coordinates: Assembly 21, CaChr5 456000-462000; 
nCENS-II: coordinates: Assembly 21, CaChr5 474000-482000) 
(Fig. 4B) to form neocentromeres. We observed that the length of 
neocentromeres was longer than native centromeres (3-5 kb) in 
Chr5 and Chrl deletion experiments ranging from 6 to 12 kb (see 
the coordinates given above). These results, taken together, strongly 
suggest that CEN-proximal sites are the most preferred sites for 
centromere seeding in C. albicans. Real-time qPCR analysis re- 
vealed significant reduction in CENPA binding on neocentromeres 
formed on Chrl and Chr5 as compared with their native coun- 
terparts (Fig. 4C). The presence of no other CENPA-binding region 
on these chromosomes was confirmed by ChlP-seq analysis on 
the CENPA/Cse4 or ProtA-Cse4 ChIP DNA fractions from trans- 
formants carrying neocentromeres (Fig. 5; GEO accession number 
GSE42907). 

Similarly to neocentromeres formed on Chr7, an additional 
CENPA-binding region apart from the native CEN confirms 



neocentromere sites on Chr5 identified by ChlP-PCR and ChlP- 
qPCR analysis. Interestingly, as reported in a previous study, a shift 
of CENPA binding from the original CENS proximal neocentric 
location to URA3 was observed when the strain was grown in FOA- 
containing media for ChIP analysis (Supplemental Fig. S5A,B). 

The mechanism of neocentromere formation remains 
conserved in C albicans and C dubliniensis 

Since the physical chromosomal location of centromeres is con- 
served in C. albicans and C. dubliniensis, we next examined the 
pattern of neocentromere formation in C. dubliniensis. We con- 
structed a URA auxotrophic strain CdJ3 by transforming Cd36 
using the MPA flipper cassette (Staib et al. 2001; see Methods). Next 
we replaced the CENPA-binding region of Chr7 (coordinates: 
CdChr7 434001-440600) of C. dubliniensis with the 1.4-kb URA3 
sequence in the CdJ3 strain (Supplemental Fig. S6A). Replacement 
of CEN7 in one homolog of Chr7 in five independent trans- 
formants, each one obtained from an independent transformation 
experiment, was confirmed by PGR. Like C. albicans, the CdCENZ- 
deleted chromosome remains mitotically stable in each case {URA3 
loss <1 in 10^ cells) (Table 1). Of the six C. dubliniensis strains an- 
alyzed, no two showed identical karyotypes in a previous study 
(Magee et al. 2008). Thus, it is possible that CdC£N7-deleted chro- 
mosomes might have been fused to a natural centric chromosome 
rather than forming a neocentromere to become mitotically stable 
as frequently found in fission yeast (Ishii et al. 2008). Karyotypic 
analysis in five such C. dubliniensis transformants carrying a stable 
altered Chr7 exhibited no chromosomal rearrangements of URA3- 
containing CENZ-deleted Chr7 (Supplemental Fig. S6B), suggesting 
that neocentromere formation is highly efficient in C. dubliniensis 
as well. However, Southern analysis of the CHEF gel probed with the 
C6.CEN7 sequence suggests additional rearrangements of chromo- 
somes. Thus, we performed locus-specific Southern analysis of these 
clones to verify replacement of CdCEN7 with URA3. Only two 
Cd]3-cen7A transformants (J202 and J204) that exhibited desired 
bands, when EcoRV-digested DNA was probed with a sequence 
adjacent to CEN 7, were taken further for mapping neocentromere 
locations (Supplemental Fig. S6C). We expressed Prot A-tagged 
CENPCl (CdMif2-Prot A) in two strains, J202 and J204, and per- 
formed Prot A (CENPCl) -Chip assays to identify neocentromere 
locations (Supplemental Fig. S6D) in resulting strains J212 and 
J216, respectively. Neocentromere locations were mapped be- 
tween 1 and 10 kb away from the deleted CdC£N7 on the altered 
chromosome in each case (Supplemental Fig. S6E). Thus, similar 
to C. albicans, centromere proximal regions are the preferred sites 
of neocentromere formation in C. dubliniensis also. 

Gene conversion occurs at C albicans centromeres 

Replacement of chromosomal regions of various lengths (4.5-30 kb) 
spanning CEN7 hy URA3 in C. albicans leads to stable propagation of 
the [/iM3-containing neocentric chromosome. During fluctuation 
assays to measure chromosome loss, some of the Ura~ derivatives of 
neocentromere harboring strains J164 (RMlOOOAH-cen/A) andJlSl 
(RM1000AH-cen7-6.5/cM) exhibited an unusual genotype: These 
segregants were prototroph for Arg and His markers indicating 
the presence of both the Chr7 homologs (Fig. 6A). These Ura~ and 
FOA"^ (resistant to FOA) segregants could not grow on CM-Ura 
media (Fig. 6B,C), suggesting that the URA3 gene was not re- 
versibly silenced. Southern analysis of the above-mentioned Ura~ 
colonies revealed the absence of the URA3 gene containing 7.6-kb 
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Figure 3. ChlP-seq analyses revealed that neocentromeres are always and exclusively formed at a close proximity to the native CEN. Relative number of 
sequence reads from the whole cell lysate (upper panels) or CENPA-Prot A or MycMif2/CENPCl ChIP DNA (lower panels) in wild-type strain J200, Cen7A- 
deleted strains Jl 78 (RMl OOOAH-cen/Zi), Jl 81 (RMl 000 AH- cen 7- 6. 5 kbA), or Jl 91 (RMl 000AH-cen7-30kbA) mapped neocentromeres (nCENs) within 1 5 
kb from the deleted region. (Left panels) Distribution of sequence reads across the entire chr7; (right panels) zoomed-in view of neocentromeric location 
with respect to the native CEN7 in each case. Chromosome coordinates are shown in the x-axis. 



Bglll fragment that was originally present in their parent 
RM1000AH-cen7A (Supplemental Fig. S7A). CHEF gel analysis in 
these Ura~ colonies revealed no detectable genomic rearrangement 
as compared with wild-type RMIOOOAH and parent RMIOOOAH- 
cen7A (Supplemental Fig. S7B). The presence of both the homologs 



of Chr7 and the absence of the [/RA^-containing Bglll fragment 
confirmed that URA3 was nonreciprocally exchanged by the CEN7 
sequence of the unaltered homolog of Chr7 through gene conver- 
sion. We detected similar gene conversion events in RMl 000 AH- 
cen7-6.5kbA transformants as well (Supplemental Fig. S7, right 
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Figure 4. Neocentromeres are preferentially formed on CEN adjacent regions on various chromosomes of C albicans. {A) CENPA ChIP analyses in 
RM1 OOOkH-cenl A clones mapped neocentromeres (nCENs) within 1 5 kb from the deleted region. (S) CENPA ChIP analyses in RM1 OOOAH-cenizi clones 
also mapped neocentromeres (nCENs) within 1 5 kb from the deleted region. C albicans carries two homologs of each chromosome. Only one homolog of 
Chrl where CENl has been replaced by URA3 is shown. CENPA enrichment shown at the native CfNs location is contributed by an unaltered homolog and 
is shown as a positive control for CENPA ChIP assays. Black arrows along with the Orf numbers show the gene arrangement and the direction of tran- 
scription. Red thick arrows on either side of CEN5 indicate inverted repeats. Neocentromeres on both chrl {nCENl-l: coordinates: Assembly 21, CaChrl 
1544000-1556000) and Chr5 (nCEN5-l: coordinates: Assembly 21, CaChr5 456000-462000; nCEN5-ll: coordinates: Assembly 21, CaChr5 474000- 
482000) formed at CENl proximal regions. Length of CENPA binding on Chrl and Chr5 neocentromeres was longer (4-1 2 kb) than native centromeres 
(3-4 kb). Parent RMl OOOAH did not show CENPA enrichment on these neocentromere sites. (T) Total DNA; (+) with Ab; (-) without Ab (beads only). 
(C) CENPA Chip qPCR analysis showing fold enrichment of CENPA on native centromeres and neocentromeres as compared with a noncentromeric re- 
gion (LEU2). Enrichment of CENPA at the URA3 location was not shown here but presented as a separate figure (Supplemental Figure S5). 



panels). Thus we conclude that gene conversion occurs at C. albicans 
centromeres. 

Centromere repositioning occurs during gene conversion 

Strains J164 (RMlOOOAH-cenZl) andJlSl (RMIOOO AH-cen7-6.5kbA) 
that exhibited gene conversion contained active neocentromeres 
on nCEN7-II (~3 kb away from CEN7 toward the left arm of Chr7) 
and nCEN7-III (—13 kb away from CEN 7 toward the right arm of 
Chr7), respectively (Figs. 2, 3). If both the acquired CEN7 and the 
preexisting neocentromere in gene-converted derivatives are 
active in each case, Chr7 could become dicentric. We performed 
Chip assays in gene-converted derivatives but no enrichment of 
CENPA/CENPCl on neocentromeric location was observed (Fig. 7). 
Together these results suggest that neocentromeres previously 
established in parent RMl OOOAH -cen/Zi and RM1000AH-cen7- 
6.5kbA strains have been inactivated in their gene-converted 
derivatives which have acquired the native centromeric region 
from the unaltered Chr7 homolog. Thus, the active centromere 
has been repositioned (from the neocentromere location) to its 
original location. 

In meiosis, gene conversion tracts range from 1 to 2 kb (Borts 
and Haber 1989; Detloff and Petes 1992; Malone et al. 1992), while 
in mitosis, some gene conversion tracts range up to hundreds of 
kilobases (Judd and Petes 1988). Since gene conversion at C. albicans 



CEN7 described here is mitotic in nature, it can encompass a larger 
region and hence include the neocentromere located 13 kb away 
from the native CEN 7 locus. To find out whether inactivation of 
the neocentromere in a gene-converted clone has occurred due to 
replacement of the neocentromere by homologous sequence dur- 
ing recombination or by an active suppression from the native 
centromere acquired from homologous chromosome, we sought 
to examine the possible length of the gene conversion tract. In 
strain J181(RM1000AH-cen7-6.5/cM), we replaced a 1.2-kb region 
(Assembly 21 CaChr7, 438440-439640) located next to the neo- 
centromere (Assembly 21 CaChr7, 440500-444000) toward CEN7 
with a dominant marker NATl (1.2 kb). Integration of the NATl 
marker on the neocentric chromosome was confirmed by South- 
ern analysis (Fig. 8A). The resulting strain J 189 is resistant to the 
drug nourseothricin. If the gene conversion tract encompasses 
the neocentromere, the NATl marker should be lost along with 
the neocentromere in its derivatives (Fig. 8B). Gene-converted 
Ura~Arg^His"^ derivatives of two independent transformants of J 189 
were examined for the presence of the NATl marker. All Ura" gene- 
converted colonies were found to be sensitive to nourseothricin 
(Fig. 8C) as well. This indicates that the neocentromere has been 
replaced by a sequence from the homologous chromosome and 
thus inactivated by the native centromere. As a result, the cen- 
tromere has been repositioned to its original place by gene con- 
version. Gene conversion can be an active mechanism to suppress 
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Figure 5. ChlP-seq analyses show that CfN proximal neocentromeres are authentic neocentromeres in Chrl and Chr5. Relative number of sequence 
reads from whole cell lysate {top panel) or CENPA-Prot A ChIP DNA (/ower panels) in J200 (wild type), J227 (RMl OOOAH-cen lA), J225, or J226 (RMl OOOAH- 
cenSA) mapped neocentromeres (nCENs) within 1 5 kb from the deleted region. (Left panels) Distribution of sequence reads across the entire chromosome; 
(right panels) zoomed-in view of the neocentromeric location along with the native CfN in each case. Chromosome coordinates are shown in the x-axis. 



neocentromeres to maintain the native centromere position con- 
served during evolution (Fig. 8D). 

Discussion 

The exclusive presence of potential neocentromere-forming 
loci at centromere proximal regions in both Candida species 



demonstrates that the physical location of a centromere, once 
established, becomes evolutionarily conserved in spite of signif- 
icant changes in the underlying sequence. A remarkably high 
conservation in relative centromere location in two independent 
clinical isolates, C. albicans 1006 and C. albicans WO-1, and in a 
related species, C. dubliniensis, strongly supports this hypothesis. 
We reason that although the native centromere is the strongest site 
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native centromere in parent chromo- 
somes (Williams et al. 1998; Nasuda et al. 
2005), suggesting that proximity to the 
native centromere favors neocentromere 
formation. In light of these observations 
and our results from comprehensive 
analysis of neocentromere formation in 
C. albicans, we propose a model shown in 
Figure 8D. 

The presence of neocentromere 
hotspots within the centromere proximal 
regions indicates that the nature of DNA 
sequences such as structural motifs or 
replication origins may play an impor- 
tant role in the assembly of CENPA on 
centromeres/neocentromeres (Sanyal 2012). 
Bending of DNA caused by centromeric 
binding proteins (CBF3 complex in 
5. cerevisiae and CENP-B in human) has 
been proposed to provide a proper ge- 
ometry for kinetochore assembly in 
yeast and human (Anderson et al. 2009). 
A comparative analysis of structural con- 
formation at functional centromeres/ 
neocentromeres may provide informa- 
tion regarding structural blueprints that 
help centromere-forming sequences to 
be part of this specialized CENP-rich three- 
dimensional (3D) scaffold described in 
the model. 

The previous report of the formation 
of neocentromeres at multiple locations 
on Chr5 in C. albicans depicts several fea- 
tures of neocentromeres: Neocentromere 
formation was seemingly efficient, ran- 
dom, and restricted primarily to inter- 
genic regions when formed away from 
the native centromere (Ketel et al. 2009). 
Considering the conserved features of 
neocentromeres in various plant, animal, 
and fungal species, the presence of a 
combination of all these properties of 
neocentromeres together in C. albicans is 
particularly striking (Marshall and Choo 
2009). Moreover, centromere location 
has been shown to be unchanged in a 
number of C. albicans clinical isolates 



for centromere formation, other potential sites {nCEN7-I-IV) are 
present at centromere proximal regions due to their similar relative 
location to that of the native centromere along the length of a 
chromosome. Interestingly, a comparative analysis (Fig. 2B) re- 
vealed that neocentromeres that are formed close to the native 
centromere (nCEN7-J) show CENPA binding comparable to that of 
the native centromeres {CENT). As the distance of a neocentromere 
site from the deleted CEN7 region increased (nCEN 7 -II and nCEN7- 
III), CENPA binding progressively dropped. Previously we have 
shown that introduction of a CEN DNA sequence close to (~7 kb 
away) a native CEN could not recruit CENPA chromatin (Baum 
et al. 2006). Neocentromere formation on a stable barley chro- 
mosome 7H derivative and Drosophila X minichromosome (7238) 
occurred on sequences that were originally situated close to the 



analyzed in a previous study in spite of 
a considerable heterogeneity in centromere DNA sequence (Mishra 
et al. 2007). Our studies on the dynamics of neocentromere for- 
mation on Chr 1, Chr 5, and Chr 7 of C. albicans as well as on Chr7 
of C. dubliniensis show some common properties reported to be 
associated with Chr5 neocentromeres. However, a total absence of 
ectopic centromere-distal neocentromeres from a large number of 
transformants analyzed in our study reveals a significant difference 
from the previous report (50% CEN5-deleted transformants formed 
neocentromeres at centromere-distal sites). Interestingly, CENPA 
enrichment was found to be very low in most of these distal neo- 
centromeres (36%-42%, 8%-14%, 36%, and 60% in NeoCEN-1, 
NeoCEN-2, NeoCEN-3, and NeoCEN-4, respectively) (Ketel et al. 
2009). A low-affinity binding of CENPA at several noncentromeric 
sites has been shown to occur in 5. cerevisiae (Lefranfois et al. 2009). 
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drive speciation through functional in- 
compatibility of centromeres in these 
two closely related asexually propagated 
yeasts. 

Methods 

Strain construction 

Strains and primers are listed in Supple- 
mental Tables S3 and S4; respectively. 



nCENJ'III 
Gene conversion 



CENPA 
CENPCI 



.IEV2 



LEV2 



CEN7 



HlSl 



A}: 



ARG4 



Ura+His+Arg+ 



CEN7/CEN74SC 



Ura-Hi8-i-Arg+ 



Figure 7. Centromere repositioning in C albicans. PCR analysis of CENPA/Cse4 and CENPCI /Mif2 
Chip DNA in wild-type RM1 OOOAH {upper), RM1 000AH-cen7-6.5/cM {middle) harboring nCEN7-lll (~1 3 
kb away from URA3 and marked by a star), and a gene-converted derivative of RM1 OOOAH-cen 7-6. 5/cM 
{bottom). Absence of CENPA and CENPCI binding at the nCEN7-iii locus after gene conversion confirms 
neocentromere inactivation. 



Co-localization of another kinetochore protein on such low- 
affinity CENPA-bound distal neocentromere sites should be done 
in order for them to be considered authentic neocentromeres. 

Despite the rapid sequence divergence, centromere locations 
in C. albicans and C. dubliniensis remain unchanged. Thus, there 
must be factors or mechanisms that are responsible for conservation 
of physical locations of centromeres while underlying sequences 
are constantly changing. Centromeric regions were thought to be 
recombination deficient, but recently, mitotic recombination has 
been shown to be taking place at mammalian centromeres at 
a very high rate (Jaco et al. 2008). Moreover, gene conversion has 
been reported to occur at centromeres of 5. cerevisiae and maize 
(Symington and Petes 1988; Shi et al. 2010). Gene conversion is 
a mechanism that repairs double-strand breaks by copying the 
homologous sequence. In the present study, we detected gene 
conversion at C. albicans centromeres. During gene conversion, 
an established neocentromere was nonreciprocally exchanged by 
its homologous sequence that originally lacked neocentromeric 
activity. This led to repositioning of CENPA from nCEN7-III to the 
native CEN7 locus. As discussed earlier, CENPA recruitment at 
neocentromeres progressively drops with an increase in distance 
from the native centromeres. Thus gene conversion might provide 
a selective advantage to a chromosome by repositioning centro- 
meric activity from the neocentromeric location (less preferred) to 
the original centromeric location (more preferred). We speculate 
that this can be a possible mechanism responsible for conservation 
of the physical chromosomal location of centromeres in C. albicans 
and C. dubliniensis. Interestingly, some of neocentromere hotspots 
in humans are sites for centromere repositioning during evolution 
(Ventura et al. 2004). Recombination coupled replication during 
gene conversion has been shown to be a highly mutagenic process 
in budding yeast (Hicks et al. 2010). Recombination is required 
to bring about the mutations necessary to drive evolution. Thus, 
while keeping the physical chromosomal location unchanged, gene 
conversion might have brought about changes in DNA sequence 
by accumulating mutations over several million years, probably to 



Construction of centromeric deletion strains 

The insert for deleting the core 4.5-kb 
CEN7 was released from pBSCR7A as de- 
scribed before (Sanyal et al. 2004). Other 
cassettes for deletion of various CEN7 
regions, 4.2-kb CENl and 7.2-kb CEN5 
sequences, were constructed as follows: 
CallRAS was cloned in pBluescriptKSII 
(-) as a NotI and PstI fragment to gener- 
ate the plasmid pBSURA3. Sequences up- 
stream of and downstream from the re- 
gion to be deleted were cloned in Sad/ 
NotI and Pstl/Xhol sites of pBSURA3. Co- 
ordinates of deleted regions, sequences 
upstream of and downstream from the 
region to be deleted that were used for the homologous re- 
combination are given in Supplemental Table SI. Primer sequences 
are available in Supplemental Table S4. Inserts containing the de- 
letion cassettes were released as Sacl/Xhol fragments and trans- 
formed into the strain RMl OOOAH. Correct integrants were con- 
firmed by Southern analysis (details of Southern analysis strategies 
are given in Supplemental Table S2). 

Expression of ProtA-tagged CENP-A in C. albicans 

The NATl sequence was amplified from the pGFPNAT plasmid 
using primers NAT-1 and NAT-2 and cloned into Smal site of 
pBluescript to give rise to pBSNAT. A Cse4 TAP cassette was am- 
plified from pCSE4TAP (Thakur and Sanyal 2012) using primer pair 
JTCse4SacII/CdMif2TAPNAT2 and cloned into end-filled SacII & Spel 
sites of pBSNAT to construct pCaCse4TAPNAT. pCaCse4TAPNAT 
was partially digested with Xhol and transformed into RMl OOOAH 
and RMlOOOAH-cenSA clones. Expression and localization of Prot 
A-tagged CENPA/Cse4 in resulting strains were verified by indirect 
immunofluorescence using anti-Prot A antibodies. CENPA/Cse4 
Prot A in all clones tested exhibited localization patterns similar to 
CENPA/Cse4 (data not shown). 

Expression ofMycMifl [CENPCI] in C albicans 

To examine the binding of Mif2/CENPC1 at various neocen- 
tromeres (nCENI-IV) we expressed MycMif2 in RMIOOOAH, 
RM1000AH-C£N7/.m43, RM1000AH-cen7-4.5/cM, RMIOOOAH- 
cen7-6.5kbA, RMIOOO AH-cen7-30kbA, RMIOOO AH-cenl-4.2kbA 
clones. A cassette containing PCX! prMycM/F2 was amplified from 
strain CAMB2 (Sanyal et al. 2004) using primers Mif2PckMycPstl-F 
and Mif2pcklSacII-R and subsequently cloned into PstI and SacII 
sites of pSF2A vector (Reuss et al. 2004) that contained NATl, the 
nourseothricin (NAT) marker as described before (Thakur and 
Sanyal 2012). The resulting plasmid was linearized using Hpal and 
introduced into various neocentromere-containing transformants 
and their gene-converted derivatives. Correct integration in each 
case was confirmed using primes MycpckMif2pstF and Mif69248 
(Supplemental Table S4). Expression and localization of Myc-tagged 
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Figure 8. Centromere repositioning occurs due to inactivation of neocentromeres by gene conversion. (A) Schematics showing a region across CEN7 
and expected size of the Swal fragment released either from the unaltered (25.5 kb) or CfN7-deleted (20.5 kb) homolog of Chr7 (/eff panel). Swal-digested 
genomic DNA plugs from the parent (RMl 000AH-cen7-6.5/cM), and three NAT! integrants were separated on a CHEF gel (EtBr stained CHEF gel is shown 
here). Probing with NAT! or URA3 sequences confirmed integration of NAT! into CfN7-deleted neocentric Chr7 in these two integrants (J1 89 and J1 90). 
(S) A table summarizes the phenotypes of the markers of the segregants. Fluctuation test analysis in two independent NAT! integrants (J189 and Jl 90) of 
J181 (RM1 000AH-cen7-6.5/cM) revealed the loss of NATl along with URA3. This result suggests that the neocentromere has been replaced along with 
URA3 by gene conversion. (C) Schematic showing possible fates of the segregants after gene conversion depending on the site of recombination initiation. 
(Red line) The observed event. (Black line) An alternative event. (Red star) Active neocentromere; (gray star) inactive neocentromere. (D) A possible 
mechanism of the maintenance of centromere function at the native physical chromosomal location. Similar to budding yeast 5. cerevisiae, centromeres 
are always clustered toward the periphery of a nucleus throughout the cell cycle in C albicans. A3D reconstructed image showing peripheral localization of 
the kinetochore cluster immunostained with anti-CENPA/Cse4 (red) and anti-CENPCI /Mif2 (green) antibodies in a nucleus (blue, DAP!) in C albicans. 
Clustering of kinetochores/centromeres is crucial for the integrity of the kinetochore ensemble as well as thefidelity of chromosome segregation in both 5. 
cerevisiae and C albicans (Anderson et al. 2009). We propose that clustering of all native centromeres into a specific 3D scaffold at the nuclear periphery 
creates a zone of high CENPA (and other kinetochore proteins including CENPC1) local concentration (pink). Potential neocentromere sites exist at 
centromere proximal regions (oval shapes of blue, brown, and violet colors) at least partly, due to their physical proximity to the CENPA-rich zone. Under 
normal conditions, the native centromere occupies the most preferred position in the centromere cluster and keeps adjacent potential neocentromere 
sites away from the CENPA-rich zone by providing steric hindrance. Removal of centromeric regions of various lengths brings centromere proximal regions 
in close proximity to the CENPA-rich zone (1). The new region now becomes a part of the CENPA-rich nuclear domain and assembles CENPA chromatin to 
form a neocentromere (2a and 2b). Gene conversion can inactivate an established neocentromere by replacing it with the homologous sequence that 
lacks the epigenetic information for CENPA assembly (3). This process is accompanied by acquisition of the native centromere from a homologous 
chromosome leading to repositioning of the centromere back to its original location. 



MycMifZ/CENPCl were verified by indirect immunofluorescence 
using anti-Myc antibodies. MycMifZ/CENPCl in all clones tested 
exhibited localization patterns similar to CENPA/Cse4 (Sup- 
plemental Figs. SIC, S3D), confirming functional expression of 
MycMifZ/CENPCl. 

Integration of the NATl marker gene to measure the length of gene 
conversion tracts 

To replace a 1.2-kb region (Assembly 21 Chr7, 438440-439640) 
with NATl marker of the same length near nCEN7-IIl upstream 
(Assembly 21 Chr7, 437700-438440, primer pair NATGCl and 
NATGC2) and downstream (Assembly 21 Chr7, 439640-440300, 
primer pair NATGC3 and NATGC4) sequences were cloned into 
SacII/Spel and Clal/SacI sites of pBSNAT, respectively. The resulting 
plasmid, named pNATGC2, was used to amplify a fragment using 
NATGCl and NATGC4 and used to transform J181 (RMIOOOAH- 
cen7-6.5kbA). Desired transformants were identified by Southern 



analysis for correct integration of NATl into the L/RA3-containing 
homolog of Chr7. In parent J181 (RM1000AH-cm7-6.5/cM), Swal 
digestion releases two fragments of different sizes, a 25.5-kb frag- 
ment from the wild type and a 20.5-kb fragment from the CEN7- 
deleted homolog of Chr 7 (Fig. 8A). CHEF gel plugs were made from 
J181 and NATl integrants using the protocol described before. 
Swal-digested genomic DNA plugs from the parent J181 and three 
NATl integrants of it were separated on a CHEF gel and transferred 
to a Zeta probe (Bio-Rad) membrane. Probing the membrane with 
NATl or URA3 sequence confirmed integration of NATl into the 
C£N7-deleted neocentric homolog in these two integrants (J189 
andJ190). 

Chromatin immunoprecipitation assay 

Chromatin immunoprecipitation assays were performed as de- 
scribed previously (Thakur and Sanyal 2011, 2012). 
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Quantitative real-time PGR analysis 

Total RNA was isolated from J151 grown in Complete-Ura and 
FOA-containing media using TRIzol Reagent from Invitrogen 
(Cat# 15596-026). cDNA was prepared using Reverse Transcrip- 
tase from Thermo Scientific (Cat# EP0441) and oligo (dT) primers 
(Sigma Cat# 04387). Real-time qPCR was performed using the 
SYBR Green supermix (BIO RAD Cat# 1708880). Reactions were 
run on the BIORAD real time machine (Bio-Rad). Three technical 
replicates were performed on RNA isolated from J151. DCt values 
were calculated from the mean of the technical replicates (Sup- 
plemental Fig. S2A). 

Real-time analysis was performed on CENPA ChlP DNA from a 
representative clone from each class of neocentromere-containing 
strain. Three biological as well as technical replicates were per- 
formed for each experiment. Primers used are listed in Supple- 
mental Table S4. The CENPA enrichment was determined by the 
percent input method. The Ct values for input were corrected for 
the dilution factor and then the percent of the input chromatin 
immunoprecipitated by the antibody was calculated as 100 x 

2(Adjusted Input Ct- IP Ct). 

Chip sequencing and analysis of data 

Enriched samples from ChlP experiments as well as the corre- 
sponding whole-cell extracts were quantified using Qubit before 
processing for Library preparation. Only samples with a minimum 
of 10 ng ds DNA (Qubit) were taken further for library preparation. 
Libraries for multiplex ChlP sequencing were constructed using 
NEXTflex ChlP-Seq Sample Preparation Kit protocol outlined in 
"Preparing Samples for ChlP Sequencing of DNA" (BIOO Scien- 
tific# IP-5 143-01). Briefly, DNA was subjected to a series of enzy- 
matic reactions that repair frayed ends and phosphorylate the 
fragments. The end repaired fragments were subjected to two 
rounds of SPRI clean up with Agencourt AMPURE XP beads 
(Beckman Coulter # A63881) for size selection of DNA inserts be- 
tween 300 and 400 bp. The size-selected fragments were ade- 
nylated with a single nucleotide "A" overhang (BIOO Scientific* 
IP-5 143-01) and adaptors were ligated (NEXT Flex adapters). The 
fragments with ligated adapters were enriched with 18 cycles of 
PCR. The prepared libraries were quantified using Qubit and 
validated for quality by running an aliquot on a High Sensitivity 
Bioanalyzer Chip (Agilent). 

Sequencing was carried out on the Illumina Genome Analyzer 
IIx (GAIIx) using flowcell (ID-63867AAXX). ChlP-seq library frag- 
ments were diluted, denatured, and hybridized to a lawn of oli- 
gonucleotides immobilized on the flow cell surface. Hybridized 
library template was amplified using immobilized oligonucleotides 
as primers. Each hybridized template through the process of iso- 
thermal bridge amplification results in the formation of a cluster 
comprised of roughly 1000 clonal copies. 

Sequencing was performed by synthesis (SBS) technology 
using four fluorescently labeled nucleotides to sequence every 
cluster on the flow cell surface in parallel. During each sequencing 
cycle, a single labeled deoxynucleotide triphosphate (dNTP) was 
added and clusters were imaged. The fluorescent dye and blocker 
was cleaved off and the next complementary base was added to the 
nucleic acid chain and imaged. Thirty-six such cycles were per- 
formed. Individual bases were called directly from signal intensity 
measurements during each cycle. Once sequencing was com- 
pleted, the raw data was extracted from the server using the pro- 
prietary Illumina pipeline software to obtain FASTQ files. Quality 
check of raw data was performed using Seq QC and 7 to 30 million 
single-end 36-nt-long reads were generated for each sample. Raw 
reads were processed using SeqQC. Reads were aligned to the target 
C. albicans SC5314 genome Assembly 21 using Bowtie version 



0.12.8 and using the parameters "-v 3-best -m V (Langmead and 
Salzberg 2012). A minimum of 2 million aligned reads were ob- 
tained per sample. Peak calling was performed using Homer v3.13 
in ''histone" mode using default parameters (Heinz et al. 2010). 
Chromosome-wise read distribution and fold-enrichment graphs 
were generated using R scripts. 

CHEF analysis 

C. albicans cells were grown in appropriate media (YPDU for 
RM 1000 AH and gene-converted strains and Complete-Ura for 
RMlOOOAH-cen/Zi) until ODgoo = 1.0. Exponentially grown cells 
were pelleted down and washed with ice chilled 50 mM EDTA. 
Agarose plugs were made according to the instruction manual 
protocol (BioRad Kit, Cat No. 170-3593) with clean cut agarose 
(0.6%) and lyticase enzyme provided in the kit. Plugs were in- 
troduced into agarose gels in 0.5 x TBE buffer (0.1 M Tris. 0.09 M 
boric acid, 0.01 M EDTA, pH 8), and electrophoresis was performed 
with a Bio-Rad CHEF-DRIII system. The gels were run in 0.5 X TBE 
buffer and maintained at 14°C throughout the procedure. Elec- 
trophoresis settings were as follows (Magee et al. 2008): 

For separation of all chromosomes: 0.6% Bio-Rad Pulsed Field 
Certified Agarose, in 0.5 X TBE buffer (50 mMTris, 50 mM boric 
acid, 1 mM EDTA, pH 8.3), 60-300 sec switch ramp, 24 h, 4.5 V/cm 
120°; 720-900 sec ramp, 12 h, 2.0 V/cm; 

For separation of smaller chromosomes: 0.9% agarose, 60-120 sec 
ramp 24 h, 6.0 V/cm; 120-360 sec ramp, 15h, 4.5 V/cm, 120°; 

For separation of Swal-digested DNA: 0.9% agarose, 0-10 sec ramp 
19 h, 6.0 V/cm; 120°. 

The gels were stained with ethidium bromide (EtBr) and analyzed 
by using the Quantity One software (Bio-Rad). 

Chromosome loss assay 

All the genetic manipulations in C. albicans were done in RM 1000 AH 
background (Sanyal et al. 2004). Each homolog of Chr7 is differ- 
entially marked with unique auxotrophic markers — ARG4 or HISl 
on the right arm. Chromosome loss in RMlOOOAH-ce^z/^, RMIOOOAH- 
cen7-6.5kbA, and ^M1000AH-cen7-30kbA transformants was scored 
by simultaneous loss of URA3 and ARG4 or URA3 and HISl 
markers. Strains were grown overnight at 30°C in nonselective 
media (YPDU) and plated on nonselective plates. After two days 
of growth at 30°C, colonies were replica-plated on selective plates 
(CM-Ura, CM-Arg, and CM-His plates). Colonies that failed to grow 
on plates lacking uridine and arginine or uridine and histidine 
were counted as colonies that lost a chromosome. A similar fluctu- 
ation test analysis was performed in NATl integrants of RMIOOOAH- 
cen7-6.5kbA, but in addition to CM-Ura, CM-Arg, and CM-His 
plates, cells were transferred to YPDU +Nat plates as well. 

Data access 

The sequencing data used in this study have been submitted to the 
NCBI Gene Expression Omnibus (GEO) (http://www.ncbi.nlm.nih. 
gov/geo/) under accession number GSE42907. 
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